Quit Emailing Yourself

# image editing → multimodal

1 link tagged with all of: image editing + multimodal

[2510.19808] Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing

The article introduces the Pico-Banana-400K dataset, a large-scale collection of 400,000 images designed for text-guided image editing. It aims to address the limitations in existing datasets by providing high-quality, diverse edit pairs generated from real photographs, facilitating advanced research in multimodal image editing techniques. The dataset includes specialized subsets for multi-turn editing, preference research, and instruction summarization.

Saved by hn_user_1 · 2 others saved this · Last saved October 28, 2025 · 3 min read

+ dataset image editing ✓ multimodal ✓

Links

[2510.19808] Pico-Banana-400K: A Large-Scale Dataset for Text-Guided Image Editing